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Econometrica, Vol. 37, No. 3 (July, 1969) 


INVESTIGATING CAUSAL RELATIONS BY ECONOMETRIC MODELS 
AND CROSS-SPECTRAL METHODS 


By C. W. J. GRANGER 


There occurs on some occasions a difficulty in deciding the direction of causality between 
two related variables and also whether or not feedback is occurring. Testable definitions 
of causality and feedback are proposed and illustrated by use of simple two-variable models. 
The important problem of apparent instantaneous causality is discussed and it is suggested 
that the problem often arises due to slowness in recording information or because a 
sufficiently wide class of possible causal variables has not been used. It can be shown that 
the cross spectrum between two variables can be decomposed into two parts, each relating 
to a single causal arm of a feedback situation. Measures of causal lag and causal strength 
can then be constructed. A generalisation of this result with the partial cross spectrum 
is suggested. 


1. INTRODUCTION 


THE OBJECT of this paper is to throw light on the relationships between certain 
classes of econometric models involving feedback and the functions arising in 
spectral analysis, particularly the cross spectrum and the partial cross spectrum. 
Causality and feedback are here defined in an explicit and testable fashion. It is 
shown that in the two-variable case the feedback mechanism can be broken down 
into two causal relations and that the cross spectrum can be considered as the 
sum of two cross spectra, each closely connected with one of the causations. 
The next three sections of the paper briefly introduce those aspects of spectral 
methods, model building, and causality which are required later. Section 5 presents 
the results for the two-variable case and Section 6 generalises these results for 
three variables. 


2. SPECTRAL METHODS 


If X, is a stationary time series with mean zero, there are two basic spectral 
representations associated with the series: 
(i) the Cramer representation, 


2.1) X,= f et dz (w), 


where z,(w) is a complex random process with uncorrelated increments so that 
(2.2) E[dz,(a@) dz,(A)] = 9, w +À, 


(ii) the spectral representation of the covariance sequence 
23 w= EIX XJ = | e dRo). 


424 
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If X, has no strictly periodic components, dF,(w) = f,(w)da@ where f,(w) is 
the power spectrum of X,. The estimation and interpretation of power spectra 
have been discussed in [4] and [5]. The basic idea underlying the two spectral 
representations is that the series can be decomposed as a sum (i.e. integral) of 
uncorrelated components, each associated with a particular frequency. It follows 
that the variance of the series is equal to the sum of the variances of the components. 
The power spectrum records the variances of the components as a function of 
their frequencies and indicates the relative importance of the components in 
terms of their contribution to the overall variance. 

If X, and Y, are a pair of stationary time series, so that Y, has the spectrum /,(@) 
and Cramer representation 


=| dzo), 


then the cross spectrum (strictly power cross spectrum) Cr(w) between X, and Y, 
is a complex function of w and arises both from 


E{dz,(@) dz,(a@)] = 0, wd, 
= Criw)da, w =À, 


and 
Va FX =| = f ee Cr(w) dw. 


It follows that the relationship between two series can be expressed only in terms 
of the relationships between corresponding frequency components. 

Two further functions are defined from the cross spectrum as being more useful 
for interpreting relationships between variables: 

(1) the coherence, 


Co)? 
DA ay 


which is essentially the square of the correlation coefficient between corresponding 
frequency components of X, and Y,, and 
(ii) the phase, 


$(e) n- 1 maginary part of Cr(q@) 
w) = 2 aaas 


2 


real part of Cr(q) 


which measures the phase difference between corresponding frequency components. 
When one variable is leading the other, 6(~)/w measure the extent of the time lag. 
Thus, the coherence is used to measure the degree to which two series are 
related and the phase may be interpreted in terms of time lags. 
Estimation and interpretation of the coherence and phase function are discussed 
in [4, Chapters 5 and 6]. It is worth noting that é(w) has been found to be robust 
under changes in the stationarity assumption [4, Chapter 9]. 
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If X,, Y,, and Z, are three time series, the problem of possibly misleading cor- 
relation and coherence values between two of them due to the influence on both 
of the third variable can be overcome by the use of partial cross-spectral 
methods. 

The spectral, cross-spectral matrix { f,(@)} = S(@) between the three variables 
is given by 


dz,(@) 
E| dz,(w) |[dz,(c) dz,(w) dz(w)] = { fi{o)} do 
dz,() 
where 
flo) = f(@) when i=j=x, 
= Cr(wm) when i=x,j=y, 
etc. 


The partial spectral, cross-spectral matrix between X, and Y, given Z, is found 
by partitioning S(œw) into components: 


T= | S41 S12 | 
S21 S22 


The partitioning lines are between the second and third rows, and second and third 
columns. The partial spectral matrix is then 


Diy = Sit Si2832 S321. 


Interpretation of the components of this matrix is similar to that involving 
partial correlation coefficients. Thus, the partial cross spectrum can be used to 
find the relationship between two series once the effect of a third series has been 
taken into account. The partial coherence and phase are defined directly from the 
partial cross spectrum as before. Interpretation of all of these functions and 
generalisations to the n-variable case can be found in [4, Chapter 5]. 


3. FEEDBACK MODELS 


Consider initially a stationary random vector X, = {X it, X2... Xu}, each 
component of which has zero mean. A linear model for such a vector consists of a 
set of linear equations by which all or a subset of the components of X, are “ex- 
plained” in terms of present and past values of components of X,. The part not 
explained by the model may be taken to consist of a white-noise random vector é,, 
such that 

Eee] = 0, tS, 


(3.1) 
= 1, Ls. 


where I is a unit matrix and 0 is a zero matrix. 


This content downloaded on Thu, 10 Jan 2013 11:28:16 AM 
All use subject to JSTOR Terms and Conditions 


CAUSAL RELATIONS 427 


Thus the model may be written as 


(3.2) AÁoX:ı = » A j;X4-j TE 
j=1 


J= 


where m may be infinite and the A’s are matrices. 

The completely general model as defined does not have unique matrices A; as an 
orthogonal transformation. Y, = AX, can be performed which leaves the form of 
the model the same, where A is the orthogonal matrix, i.e., a square matrix having 
the property AA’ = I. This is seen to be the case as n, = Ae, is still a white-noise 
vector. For the model to be determined, sufficient a priori knowledge is required 
about the values of the coefficients of at least one of the A’s, in order for constraints 
to be set up so that such transformations are not possible. This is the so-called 
“identification problem” of classical econometrics. In the absence of such a priori 
constraints, A can always be chosen so that the Ag is a triangular matrix, although 
not uniquely, thus giving a spurious causal-chain appearance to the model. 

Models for which A, has nonvanishing terms off the main diagonal] will be called 
‘“‘models with instantaneous causality.” Models for which Av, has no nonzero term 
off the main diagonal will be called “simple causal models.’’ These names will be 
explained later. Simple causal models are uniquely determined if orthogonal 
transforms such as A are not possible without changing the basic form of the model. 
It is possible for a model apparently having instantaneous causality to be trans- 
formed using an orthogonal A to a simple causal model. 

These definitions can be illustrated simply in the two variable case. Suppose the 
variables are X,, Y,. Then the model considered is of the form 


X, + boY, = > ajX,y—; + > bY, -j + &, 
j= 1 ‘= 
(3.3) f 
2 C;X,-; + > dY Gy 
j=1 


j=1 


Y, + CoX: 


If bo = co = 0, then this will be a simple causal model. Otherwise it will be a 
model with instantaneous causality. 

Whether or not a model involving some group of economic variables can be a 
simple causal model depends on what one considers to be the speed with which 
information flows through the economy and also on the sampling period of the 
data used. It might be true that when quarterly data are used, for example, a simple 
causal model is not sufficient to explain the relationships between the variables, 
while for monthly data a simple causal model would be all that is required. Thus, 
some nonsimple causal models may be constructed not because of the basic pro- 
perties of the economy being studied but because of the data being used. It has 
been shown elsewhere [4, Chapter 7; 3] that a simple causal mechanism can appear 
to be a feedback mechanism if the sampling period for the data is so long that 
details of causality cannot be picked out. 
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4. CAUSALITY 


Cross-spectral methods provide a useful way of describing the relationship 
between two (or more) variables when one is causing the other(s). In many realistic 
economic situations, however, one suspects that feedback is occurring. In these 
situations the coherence and phase diagrams become difficult or impossible to 
interpret, particularly the phase diagram. The problem is how to devise definitions 
of causality and feedback which permit tests for their existence. Such a definition 
was proposed in earlier papers [4, Chapter 7; 3]. In this section, some of these 
definitions will be discussed and extended. Although later sections of this paper will 
use this definition of causality they will not completely depend upon it. Previous 
papers concerned with causality in economic systems [1, 6, 7, 8] have been particu- 
larly concerned with the problem of determining a causal interpretation of simul- 
taneous equation systems, usually with instantaneous causality. Feedback is not 
explicitly discussed. This earlier work has concentrated on the form that the param- 
eters of the equations should take in order to discern definite causal relationships. 
The stochastic elements and the natural time ordering of the variables play relative- 
ly minor roles in the theory. In the alternative theory to be discussed here, the stoch- 
astic nature of the variables and the direction of the flow of time will be central 
features. The theory is, in fact, not relevant for nonstochastic variables and will rely 
entirely on the assumption that the future cannot cause the past. This theory will not, 
of course, be contradictory to previous work but there appears to be little common 
ground. Its origins may be found in a suggestion by Wiener [9]. The relationship be- 
tween the definition discussed here and the work of Good [2] has yet to be determined. 

If A, is a stationary stochastic process, let A, represent the set of past values 
{A,_;,j = 1,2,...,00} and A, represent the set of past and present values {A,- ;, 
j =0,1,..., oo}. Further let A(k) represent the set {A,_;,j = k,k + 1,..., co}. 

Denote the optimum, unbiased, least-squares predictor of A, using the set of 
values B, by P,(A|B). Thus, for instance, P,(X|X) will be the optimum predictor of X, 
using only past X,. The predictive error series will be denoted by ¢,A|B) = A, — 
P(A|B). Let o7(A|B) be the variance of ¢,(A|B). 

The initial definitions of causality, feedback, and so forth, will be very general in 
nature. Testable forms will be introduced later. Let U, be all the information in the 
universe accumulated since time t — 1 and let U, — Y, denote all this information 
apart from the specified series Y,. We then have the following definitions. 


DEFINITION 1 : Causality. Ifo?(X|U) < o?(X|U — Y), we say that Y is causing X, 
denoted by Y, = X,. We say that Y, is causing X, if we are better able to predict X, 
using all available information than if the information apart from Y, had been used. 


DEFINITION 2: Feedback. If 
a?(X|U) < o7(X|U — Y), 
a2(Y|U) < o?(Y|U — X), 


we say that feedback is occurring, which is denoted Y, <> X,, 1.e., feedback is said 
to occur when X, is causing Y, and also Y, is causing X;,. 
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DEFINITION 3: Instantaneous Causality. If o?(X|U, Y) < o?(X|U), we say that 
instantaneous causality Y, = Xis occurring. In other words, the current value of X, 
is better ‘“‘predicted’’ if the present value of Y, is included in the “‘prediction”’ than if 
it is not. 


DEFINITION 4: Causality Lag. If Y, = X,, we define the (integer) causality lag m to 
be the least value of k such that o7(X|U — Y(k)) < o7(X|U — Y(k + 1)). Thus, 
knowing the values Y,_;,j = 0,1,...,m — 1, will be of no help in improving the 
prediction of X,. 

The definitions have assumed that only stationary series are involved. In the 
nonstationary case, o(X|U) etc. will depend on time t and, in general, the existence 
of causality may alter over time. The definitions can clearly be generalised to be 
operative for a specified time t. One could then talk of causality existing at this 
moment of time. Considering nonstationary series, however, takes us further away 
from testable definitions and this tack will not be discussed further. 

The one completely unreal aspect of the above definitions is the use of the series 
U,, representing all available information. The large majority of the information in 
the universe will be quite irrelevant, i.e., will have no causal consequence. Suppose 
that all relevant information is numerical in nature and belongs to the vector set of 
time series YP = {Y!,ie D} for some integer set D. Denote the set {i e D, i # j} by 
D(j) and {Y!,ie D(j)} by YP, i.e., the full set of relevant information except one 
particular series. Similarly, wecould leave out more than one series with the obvious 
notation. The previous definitions can now be used but with U, replaced by Y, and 
U, — Y, by Y?. Thus, for example, suppose that the vector set consists only of 
two series, X, and Y, and that all other information is irrelevant. Then o7(X|X) 
represents the minimum predictive error variance of X, using only past X, and 
o*(X|X, Y) represents this minimum variance if both past X, and past Y, are used to 
predict X,. Then Y, is said to cause X, if 7(X|X) > o7(X|X, Y). The definition of 
causality is now relative to the set D. If relevant data has not been included in this 
set, then spurious causality could arise. For instance, if the set D was taken to 
consist only of the two series X, and Y,, but in fact there was a third series Z, which 
was causing both within the enlarged set D’ = (X,, Y,, Z,), then for the original set D, 
spurious causality between X, and Y, may be found. This is similar to spurious 
correlation and partial correlation between sets of data that arise when some other 
statistical variable of importance has not been included. 

In practice it will not usually be possible to use completely optimum predictors, 
unless all sets of series are assumed to be normally distributed, since such optimum 
predictors may be nonlinear in complicated ways. It seems natural to use only 
linear predictors and the above definitions may again be used under this assumption 
of linearity. Thus, for instance, the best linear predictor of X, using only past X, and 
past Y, will be of the form 


P(X|X, Y) = ¥ a;X,-; + ¥ bjY,-; 
, = 


j=l 


where the a,’s and b,’s are chosen to minimise o7(X|X, Y). 
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It can be argued that the variance is not the proper criterion to use to measure 
the closeness of a predictor P, to the true value X,. Certainly if some other criteria 
were used it may be possible to reach different conclusions about whether one 
series is causing another. The variance does seem to be a natural criterion to use in 
connection with linear predictors as it is mathematically easy to handle and simple 
to interpret. If one uses this criterion, a better name might be “causality in mean.” 

The original definition of causality has now been restricted in order to reach a 
form which can be tested. Whenever the word causality is used in later sections it 
will be taken to mean “‘linear causality in mean with respect to a specified set D.” 

It is possible to extend the definitions to the case where a subset of series D* of D 
is considered to cause X,. This would be the case if o°(X| Y?) < o7(X|Y?~™") and 
then Y”* = X,. Thus, for instance, one could ask if past X, is causing present X,. 
Because new concepts are necessary in the consideration of such problems, they 
will not be discussed here in any detail. 

It has been pointed out already [3] that instantaneous causality, in which know- 
ledge of the current value of a series helps in predicting the current value of a 
second series, can occasionally arise spuriously in certain cases. Suppose Y, > X, 
with lag one unit but that the series are sampled every two time units. Then al- 
though there is no real instantaneous causality, the definitions will appear to 
suggest that such causality is occurring. This is because certain relevant informa- 
tion, the missing readings in the data, have not been used. Due to this effect, one 
might suggest that in many economic situations an apparent instantaneous 
causality would disappear if the economic variables were recorded at more fre- 
quent time intervals. 

The definition of causality used above is based entirely on the predictability of 
some series, say X,. If some other series Y, contains information in past terms that 
helps in the prediction of X, and if this information is contained in no other series 
used in the predictor, then Y is said to cause X,. The flow of time clearly plays a cen- 
tral role in these definitions. In the author’s opinion there is little use in the practice 
of attempting to discuss causality without introducing time, although philosophers 
have tried to do so. It also follows from the definitions that a purely deterministic 
series, that is, a series which can be predicted exactly from its past terms such as a 
nonstochastic series, cannot be said to have any causal influences other than its 
own past. This may seem to be contrary to common sense in certain special cases 
but it is difficult to find a testable alternative definition which could include the 
deterministic situation. Thus, for instance, if X, = bt and Y, = c(t + 1), then X, 
can be predicted exactly by b + X,_, or by (b/c) ¥,_,. There seems to be no way 
of deciding if Y, is a causal factor of X, or not. In some cases the notation of the 
“simplest rule” might be applied. For example, if X, is some complicated poly- 
nomial in t and Y, = X,,,, then it will be easier to predict X, from Y,_, than 
from past X,. In some cases this rule cannot be used, as the previous example 
showed. In any case, experience does not indicate that one should expect economic 
laws to be simple in nature. 

Even for stochastic series, the definitions introduced above may give apparently 
silly answers. Suppose X, = A,_, + &, Y, = A, + 1, and Z, = A, + Y, where ¢,, 
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y,, and y, are all uncorrelated white-noise series with equal variances and 4A, 1s 
some stationary series. Within the set D = (X,, Y,) the definition gives Y, => X,. 
Within the set D’ = (X,, Y,), it gives Z, = X,. But within the set D” = (X,, Y,, Z,), 
neither Y, nor Z, causes X,, although the sum of Y, and Z, would do so. How is one 
to decide if either Y, or Z, is a causal series for X,? The answer, of course, is that 
neither is. The causal series is A, and both Y, and Z, contain equal amounts of 
information about 4,. If the set of series within which causality was discussed was 
expanded to include A,, then the above apparent paradox vanishes. It will often be 
found that constructed examples which seem to produce results contrary to 
common sense can be resolved by widening the set of data within which causality is 
defined. 


5. TWO-VARIABLE MODELS 


In this section, the definitions introduced above will be illustrated using two- 
variable models and results will be proved concerning the form of the cross spect- 
rum for such models. 

Let X,, Y, be two stationary time series with zero means. The simple causal model 
iS 

m m 
X= py ajX,-; + 2 bY, -j + &, 
J= J= 


(5.1) 


m 


Y= >) Xj DY aij +m, 
j=1 


j=1 


where £, n, are taken to be two uncorrelated white-noise series, 1.e., Elese] = 0 = 
Einm], s # t, and Efese] = 0 all t, s. In (5.1) m can equal infinity but in practice, of 
course, due to the finite length of the available data, m will be assumed finite and 
shorter than the given time series. 

The definition of causality given above implies that Y, is causing X, provided 
some b; is not zero. Similarly X, is causing Y, if some c; is not zero. If both of these 
events occur, there is said to be a feedback relationship between X, and Y,. It will 
be shown later that this new definition of causality is in fact identical to that intro- 
duced previously. 

The more general model with instantaneous causality is 


X, + bok = > a;X,—j + > D: Yei F G 
J=4 j=1 


(5.2) 


r + CoX, = > CX -j + X d;Y,-; + UPE 
j=1 


j=1 


If the variables are such that this kind of representation is needed, then instant- 
aneous causality is occurring and a knowledge of Y, will improve the “prediction” 
or goodness of fit of the first equation for X,. 
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Consider initially the simple causal model (5.1). In terms of the time shift operator 
U—UX, = X,_,-——these equations may be written 
(53) X, = a(U)X, + b(U)Y, + &, 

| Y, = c(U)X, + d(U)Y, + n, 


where a(U), b( U} c( U), and d(U) are power series in U with the coefficient of U° zero, 
i.e., a(U) = a;U!, etc. 


j= 1 “j 
Using the Cramer representations of the series, 1.e., 


x= f eaZ y=] azo), 
and similarly for ¢, and y,, expressions such as a(U)X, can be written as 


a(U)X, = | e@a(e~!) dZ (a). 
Thus, equations (5.3) may be written 
f ee = ale) dz,(@) — ble™™) dz) — dZ) = 0, 


f etl — cle) dZ,(w) + (1 — d(e~'”)) dZ w) — dZ,(w)] = 0, 


from which it follows that 
dZ. dZ, 
(5.4) A = 
dZ, dZ, 


where 
I =a =b l 
A= 
—c Il1-—d 
and where a is written for a(e~‘”), etc., and dZ, for dZ (œ), etc. 
Thus, provided the inverse of A exists, 


ji 7 pa 
(5.5) = Aq} l 
dZ, dZ, 


As the spectral, cross-spectral matrix for X,, Y, is directly obtainable from 


Ao | dZ, dZ] 
dZ, a 


these functions can quickly be found from (5.5) using the known properties of dZ, 
and dZ,,. One finds that the power spectra are given by 


fo) = ll — dla? + |bl’on), 
(5.6) 
F(@) = lao +|1—al’o;), 


This content downloaded on Thu, 10 Jan 2013 11:28:16 AM 
All use subject to JSTOR Terms and Conditions 


CAUSAL RELATIONS 433 


where A = |(1 — a)(1 — d) — bc|*. Of more interest is the cross spectrum which has 
the form 


Cr(@) = lll — d)to? + (1 — a@bo?). 


Thus, the cross spectrum may be written as the sum of two components 


(5.7) — Cr(w) = Clw) + Clo), 








where 
o2 
Ci(@) = PTAS — d)c 
and 
o2 
Clw) = rie — ab. 


If Y, is not causing X,, then b = 0 and so C (œ) vanishes. Similarly if X, is not 
causing Y, then c = 0 and so C,(q) vanishes. It is thus clear that the cross spectrum 
can be decomposed into the sum of two components—one which depends upon 
the causality of X by Y and the other on the causality of Y by X. 

If, for example, Y is not causing X so that C,(w) vanishes, then Cr(w) = Cw) 
and the resulting coherence and phase diagrams will be interpreted in the usual 
manner. This suggests that in general C ,(w) and C,(w) can each be treated separately 
as cross spectra connected with the two arms of the feedback mechanism. Thus, 
coherence and phase diagrams can be defined for X = Y and Y > X. For example, 


|Ci(@)|? 
JLo) fy(@) 
may be considered to be a measure of the strength of the causality X = Y plotted 
against frequency and is a direct generalisation of coherence. We call C3(@) the 


causality coherence. 
Further, 


Czw) = 


, Imaginary part of C,(w) 


zlo) = tan real part of C (œ) 


will measure the phase lag against frequency of X = Y and will be called the 
causality phase diagram. 

Similarly such functions can be defined for Y > X using C,(). 

These functions are usually complicated expressions ina, b, c,andd;forexample, 


os|(1 — d)c|? 
(oll — dl? + o7|b\*)(oz lel? + [1 — alto) 
Such formulae merely illustrate how difficult it is to interpret econometric models 


in terms of frequency decompositions. It should be noted that 0 < |C3(w)| < land 
similarly for Clw). 


Coo) = 
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As an illustration of these definitions, we consider the simple feedback system 


(5.8) X, = bY,-1 + €,, 
Y, = cX,—2 w Nt» 


where o2 = of = 1. 


In this case 


a(w) = 0, 
b(w) = be ®, 
c(w) = ce 7, 
d(w) = 0. 
The spectra of the series {X,}, { Y,} are 
fa) = 5+ * 


2nil ~ be e~ 79 |? 


and 
1 +c? 
2n|1 — be e7??? 


f(@) = 


and thus are of similar shape. 
The usual coherence and phase diagrams derived from the cross spectrum be- 
tween these two series are 


c? + b? + 2bc cos w 


CO) = AG e 


and 
csin 2m — b sin œ 


w) = tan~ t= 
Po) c cos 2m + b cos œw 


These diagrams are clearly of little use in characterising the feedback relationship 
between the two series. 
When the causality-coherence and phase diagrams are considered, however, we 
get 
2 2 
PE C5x(@) = a 
(1 + b*)(1 + c°) (1 + b*)(1 + c°) 


Both are constant for alla, and,ifb 4 0,c # 0,@5(w) = 2w(time lag oftwounits),’ 
b5z(w) = w (time-lag of one unit). 

The causality lags are thus seen to be correct and the causality coherences to be 
reasonable. In particular, if b = 0 then C33(@) = 0, i.e., no causality is found when 
none is present. (Further, in this new case, @5(~) = 0.) 


Czw) = 


1 A discussion of the interpretation of phase diagrams in terms of time lags may be found in Granger 
and Hatanaka [4, Chapter 5]. 
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Other particular cases are also found to give correct results. If, for example, we 
again consider the same simple model (4.8) but with o2 = 1,07 = 0, i.e., n, = 0 for 
all t, then one finds 


Cz(o) = 1, 
Crx(@) = 0, 


i.e., X is “perfectly” causing Y and Y is not causing X, as is in fact the case. 
If one now considers the model (5.2) in which instantaneous causality is allowed, 
it is found that the cross spectrum is given by 


1 
(5.9) Cro) = = Fld — aye — cojo; + (1 — a)(b — bo)az] 


where 
A’ = \(1 — a)(1 — d) — (b — bo)(c — co)’. 


Thus, once more, the cross spectrum can be considered as the sum of two compon- 
ents, each of which can be associated with a “causality,” provided that this includes 
instantaneous causality. It is, however, probably more sensible to decompose 
Cr(qw) into three parts, Cr(œ) = C,(@) + Calw) + Clw), where C,(m) and C,(@) 
are as in (5.7) but with A replaced by A’ and 


(5.10) C3(w) = eo — djo? + ba(l — ajo?) 


representing the influence of the instantaneous causality. 

Such a decomposition may be useful but it is clear that when instantaneous 
causality occurs, the measures of causal strength and phase lag will lose their 
meaning. 

It was noted in Section 3 that instantaneous causality models such as (5.2) in 
general lack uniqueness of their parameters as an orthogonal transformation A 
applied to the variables leaves the general form of the model unaltered. It is 
interesting to note that such transformations do not have any effect on the cross 
spectrum given by (5.9) or the decomposition. This can be seen by noting that 
equations (5.2) lead to 


pi pi 
A — 
dz, dz, 
with appropriate A. Applying the transformation A gives 
AA "i _ Al e 
dz, dz, 
dz, dz, 
= (AA) tA 
dz, dz 
jei Hi 
dz, 


so that 
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which is the same as if no such transformation had been applied. From its defini- 
tion, A will possess an inverse. This result suggests that spectral methods are more 
robust in their interpretation than are simultaneous equation models. 

Returning to the simple causal model (5.3), 


Xx, = a(U)X, + b(U)Y, + &, 
t— c(U)X, + dU)Y, + m, 


throughout this section it has been stated that Y, Æ X,ifb = 0.Onintuitive grounds 
this seems to fit the definition of no causality introduced in Section 4, within the set 
D of series consisting only of X,and Y,. If b = 0 then X, is determined from the first 
equation and the minimum variance of the predictive error of X, using past X, 
will be o2. This variance cannot be reduced using past Y,. It is perhaps worthwhile 
proving this result formally. In the general case, it is clear that 7(X|X, Y) = o2, i.e., 
the variance of the predictive error of X,, if both past X, and past Y, are used, will be 
o? from the top equation. If only past X, is used to predict X,, it is a well known 
result that the minimum variance of the predictive error is given by 


(5.11)  o07(X|X) = expr | log $x fw) do. 
It was shown above in equation (5.6) that 
— I 2 2 2 2 
fo) ~~ ITA (|1 d| O; + Ib] 07) 
where A = |(1 — a)(1 — d) — bc|?. To simplify this equation, we note that 
f log|1 — a e|? dw = 0 


by symmetry. Thus if, 

m1 — a, e|? 

"nll — Be?’ 

then o7(X|X) = a. For there to be no causality, we must have %) = 7. It is clear 
from the form of f,(@) that in general this could only occur if |b| = 0, in which case 
2nf,(w) = o2/\1 — al? and the required result follows. 


Feo) = a 


6. THREE-VARIABLE MODELS 


The above results can be generalised to the many variables situation, but the 
only case which will be considered is that involving three variables. 
Consider a simple causal model generalising (5.1): 


X, = a,(U)X, + b,(U)¥, + c,(U)Z, + Elto 
Y, = aa(U)X, + ba(U)Y, + c2(U)Z, + £21; 
Z, = a3(U)X, + b3(U)Y, + c3(U)Z, + E3 t> 
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where a,(U), etc., are polynomials in U, the shift operator, with the coefficient of U? 
zero. As before, ¢;,, i = 1, 2, 3, are uncorrelated, white-noise series and denote the 
variance £; = Of. 

Leta =a, — 1, $ = b, —1,y =c3 — 1, and 


a bi cy 
A=ja, B cj, 
az bz 7 


where b, = b,(e~ ™), etc., as before. Using the same method as before, the spectral, 
cross-spectral matrix S(qw) is found to be given by S(w) = A~'k(A’)~ ' where 


of 0 0 
k=|0 of 0 
0 0 ø 


One finds, for instance, that the power spectrum of X, is 
fw) = |A|~7Lot|By — c2b3|? + o3]c1b3 — yby|? + o3|bicz — cpl] 
where 4 is the determinant of A. 
The cross spectrum between X, and Y, is 
CP(@) = |A *[ot(By — c2b3)(c2a3 — yaz) + 03(c,b3 — biy)(wy — c1a3) 
+ 63(byCc2 — C1 B)(cya2 — C20). 
Thus, this cross spectrum is the sum of three components, but it is not clear that 
these can be directly linked with causalities. More useful results arise, however, 
when partial cross spectra are considered. After some algebraic manipulation it is 
found that, for instance, the partial cross spectrum between X, and Y, given Z, is 
CHo) = _ [ojo3b3a3 + 6303Baz + 0303610] 
fo) 
where 
fw) = o7|By — c2b3|* + o3|cyb3 — byyl|* + o3|bic2 — cB)’. 
Thus, the partial cross spectrum is the sum of three components 
CY) = CP? + CH? + CP” 
where 
ee {03303 
fio) © 
These can be linked with causalities. The component C= (œ) represents the inter- 
relationships of X, and Y, through Z,, and the other two components are direct 


generalisations of the two causal cross spectra which arose in the two variable 
case and can be interpreted accordingly. 


C= etc. 
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In a similar manner one finds that the power spectrum of X,, given Z, is 
o103|b3|° + o103/B|* + 02031b,|° 
fœ) 

The causal and feedback relationships between X, and Y, can be investigated 
in terms of the coherence and phase diagrams derived from the second and third 
components of the partial cross spectrum, i.e., 

Ices? 
tae 


Fx,2(@) = 


= 
coherence (xy, z) = etc. 


7. CONCLUSION 


The fact that a feedback mechanism may be considered as the sum of two causal 
mechanisms and that these causalities can be studied by decomposing cross or 
partial cross spectra suggests methods whereby such mechanisms can be in- 
vestigated. Hopefully, the problem of estimating the causal cross spectra will be 
discussed in a later publication. There are a number of possible approaches and 
accumulated experience is needed to indicate which is best. Most of these 
approaches are via the model-building method by which the above results were 
obtained. It is worth investigating, however, whether a direct method of estimating 
the components of the cross spectrum can be found. 


The University of Nottingham 
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